Electronic Companion — “ A Framework for Reconciling Attribute Values from Multiple Data Sources
نویسندگان
چکیده
Proof of Proposition 1. Suppose an attribute value ai is not recorded in any of the data sources S1 through Sn for an entity instance. Then, from Assumptions 1 and 2 in the paper, we have P A= ai AS1 = ak AS2 = al ASn = atW i = k i = l i = t = P AS1 = ak A= ai P AS2 = al A= ai × · · ·×P ASn = at A= ai P AS1 = ak AS2 = al ASn = at P A= ai = 1−R A S1 / m− 1 1−RS2 / m− 1 × · · ·× 1−RSn / m− 1 P AS1 = ak AS2 = al ASn = at P A= ai Clearly, the above probability expression is proportional to the prior probability P A= ai ∀ i. Therefore, we must have P A= ai AS1 = ai AS2 = ai ASn = ai P A= aj AS1 = aj AS2 = aj ASn = aj = P A= ai P A= aj ∀ i j
منابع مشابه
Reconciling Attribute Values from Multiple Data Sources
Because of the heterogeneous nature of multiple data sources, data integration is often one of the most challenging tasks of today’s information systems. While the existing literature has focused on problems such as schema integration and entity identification, our current study attempts to answer a basic question: When an attribute value for a real-world entity is recorded differently in two d...
متن کاملA Framework for Reconciling Attribute Values from Multiple Data Sources
B of the heterogeneous nature of different data sources, data integration is often one of the most challenging tasks in managing modern information systems. While the existing literature has focused on problems such as schema integration and entity identification, it has largely overlooked a basic question: When an attribute value for a real-world entity is recorded differently in different dat...
متن کاملReconciling Continuous Attribute Values from Multiple Data Sources
Because of the heterogeneous nature of different data sources, data integration is often one of the most challenging tasks in managing modern information systems. The challenges exist at three different levels: schema heterogeneity, entity heterogeneity, and data heterogeneity. The existing literature has largely focused on schema heterogeneity and entity heterogeneity; and the very limited wor...
متن کاملConversion Rules from Disparate Data Sources
The successful integration of data from autonomous and heterogeneous systems calls for the resolution of semantic conflicts that may be present. Such conflicts are often reflected by discrepancies in attribute values of the same data object. In this paper, we describe a recently developed prototype system, DIRECT (DIscovering and REconciling ConflicTs). The system mines data value conversion ru...
متن کاملAttribute-based Access Control for Cloud-based Electronic Health Record (EHR) Systems
Electronic health record (EHR) system facilitates integrating patients' medical information and improves service productivity. However, user access to patient data in a privacy-preserving manner is still challenging problem. Many studies concerned with security and privacy in EHR systems. Rezaeibagha and Mu [1] have proposed a hybrid architecture for privacy-preserving accessing patient records...
متن کامل